Add Nvidia inference specification #5794

Jan-Kazlouski-elastic · 2025-12-05T10:55:28Z

This PR adds changes to specification caused by elastic/elasticsearch#132388

Additional actions

Signed the CLA
Executed make contrib

github-actions · 2025-12-05T10:59:56Z

Following you can find the validation changes against the target branch for the API.

API	Status	Request	Response
`inference.put_nvidia`	➕ ⚪	Missing test	Missing test

You can validate this API yourself by using the make validate target.

# Conflicts: # output/openapi/elasticsearch-openapi.json # output/openapi/elasticsearch-serverless-openapi.json # output/schema/schema.json

DonalEvans · 2025-12-12T00:06:03Z

package.json

  },
  "dependencies": {
-    "@redocly/cli": "^1.34.5"
+    "@redocly/cli": "^1.34.6"


I don't think this should be getting changed here.

DonalEvans · 2025-12-12T00:35:08Z

specification/_json_spec/inference.put_nvidia.json

+                "rerank",
+                "text_embedding",
+                "completion",
+                "chat_completion"


Nitpick, but could these be in alphabetical order?

DonalEvans · 2025-12-12T00:41:01Z

specification/inference/_types/CommonTypes.ts

+   */
+  model_id: string
+  /**
+   * For a `text_embedding` task, the maximum number of tokens per input before chunking occurs.


This should be "For a `text_embedding` task, the maximum number of tokens per input. Inputs exceeding this value are truncated prior to sending to the Nvidia API."

This is wrong almost everywhere in the docs; there's an issue describing some of the problems with max_input_tokens.

DonalEvans · 2025-12-12T00:41:41Z

specification/inference/_types/CommonTypes.ts

+  text_embedding,
+  completion,
+  chat_completion,
+  rerank


For consistency, could these be in alphabetical order?

DonalEvans · 2025-12-12T00:49:13Z

specification/inference/_types/CommonTypes.ts

+   */
+  input_type?: NvidiaInputType
+  /**
+   * For a `text_embedding` task, the method to handle inputs longer than the maximum token length.


To help differentiate this from max_input_tokens it might be better to word it like "the method used by the Nvidia model to handle inputs longer than..."

DonalEvans · 2025-12-12T00:54:58Z

specification/inference/_types/CommonTypes.ts

+  /**
+   * The URL of the Nvidia model endpoint.
+   */


Would it be helpful to include the default URLs for each task type if url isn't specified?

DonalEvans · 2025-12-12T00:59:38Z

specification/inference/_types/TaskType.ts

+  text_embedding,
+  chat_completion,
+  completion,
+  rerank


For consistency, could these be in alphabetical order?

Add Nvidia inference specification

f9ac65a

Jan-Kazlouski-elastic assigned DonalEvans Dec 5, 2025

Jan-Kazlouski-elastic requested a review from a team as a code owner December 5, 2025 10:55

Jan-Kazlouski-elastic added specification ml skip-backport This pull request should not be backported Team:ML labels Dec 5, 2025

Jan-Kazlouski-elastic requested a review from DonalEvans December 5, 2025 11:04

Merge remote-tracking branch 'origin/main' into nvidia-integration

3b8f3a5

# Conflicts: # output/openapi/elasticsearch-openapi.json # output/openapi/elasticsearch-serverless-openapi.json # output/schema/schema.json

DonalEvans reviewed Dec 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Nvidia inference specification #5794

Add Nvidia inference specification #5794

Jan-Kazlouski-elastic commented Dec 5, 2025

Uh oh!

github-actions bot commented Dec 5, 2025 •

edited

Loading

Uh oh!

DonalEvans Dec 12, 2025

Uh oh!

DonalEvans Dec 12, 2025

Uh oh!

DonalEvans Dec 12, 2025

Uh oh!

DonalEvans Dec 12, 2025

Uh oh!

DonalEvans Dec 12, 2025

Uh oh!

DonalEvans Dec 12, 2025

Uh oh!

DonalEvans Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add Nvidia inference specification #5794

Are you sure you want to change the base?

Add Nvidia inference specification #5794

Conversation

Jan-Kazlouski-elastic commented Dec 5, 2025

Uh oh!

github-actions bot commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DonalEvans Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

DonalEvans Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

DonalEvans Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

DonalEvans Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

DonalEvans Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

DonalEvans Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

DonalEvans Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Dec 5, 2025 •

edited

Loading